A rough set theory and deep learning-based predictive system for gender recognition using audio speech

نویسندگان

چکیده

Speech is one of the most delicate medium through which gender speakers can easily be identified. Though related research has shown very good progress in machine learning, but recently, deep learning imparted a area to explore deficiency discrimination using traditional techniques. In techniques, speech features are automatically generated by reinforcement from raw data have more discriminating power than human-generated features. But some practical situations like recognition, it observed that combination both types sometimes provides comparatively better performance. proposed work, we initially extracted and selected informative precise acoustic relevant recognition entropy-based information theory Rough Set Theory (RST). Next, audio signals directly fed into neural network model consisting Convolution Neural Network (CNN) Gated Recurrent Unit (GRUN) for extracting useful recognition. The RST selects features, CNN extracts locally encoded important GRUN reduces vanishing gradient exploding problems. Finally, hybrid system developed combining feature vectors. been tested with five bench mark simulated dataset evaluate its performance, combined vector effective specially when transgender considered as type together male female.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

Open-Domain Audio-Visual Speech Recognition: A Deep Learning Approach

Automatic speech recognition (ASR) on video data naturally has access to two modalities: audio and video. In previous work, audio-visual ASR, which leverages visual features to help ASR, has been explored on restricted domains of videos. This paper aims to extend this idea to open-domain videos, for example videos uploaded to YouTube. We achieve this by adopting a unified deep learning approach...

متن کامل

A system for audio-visual speech recognition

In this work, a system of audio visual speech recognition will be presented. A new hybrid visual feature combination, which is suitable for audio -visual speech recognition was implemented. The features comprise both the shape and the appearance of lips, the dimensional reduction is applied using discrete cosine transform (DCT). A large visual speech database of the German language has been ass...

متن کامل

A Novel Emotion Recognition Method Based on Ensemble Learning and Rough Set Theory

Emotion recognition is a very hot topic, which is related with computer science, psychology, artificial intelligence, etc. It is always performed on facial or audio information with classical method such as ANN, fuzzy set, SVM, HMM, etc. Ensemble learning theory is a novelty in machine learning and ensemble method is proved an effective pattern recognition method. In this paper, a novel ensembl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Soft Computing

سال: 2022

ISSN: ['1433-7479', '1432-7643']

DOI: https://doi.org/10.1007/s00500-022-07074-z